Steganalysis of Digital Images Using Rich Image Representations and Ensemble Classifiers By
نویسنده
چکیده
Modern detectors of steganographic communication in digital images are implemented as supervised classifiers trained in pre-defined feature spaces also called image models. Currently, the Support Vector Machine (SVM) is the machine-learning tool of choice in the steganalysis community due to its accuracy and a well-founded theory. However, in order to keep the SVM training computationally feasible, feature spaces need to be designed to be low-dimensional. Consequently, their construction often consists of a series of clever tricks and heuristic dimensionality-reduction techniques. Recent trends in steganalysis, however, have shown that more complex and higher-dimensional image models could deliver substantially better performance. In this dissertation, we propose a novel framework for steganalysis of digital images in which we replace SVMs with the ensemble classifier, a scalable machine-learning alternative offering comparable accuracy at a fraction of a computational cost. This allows us to approach the feature-space building process in a more systematic and exhaustive way. In particular, we propose to construct feature spaces as collections of a large number of simpler submodels, each of them capturing different types of dependencies among image coefficients. As a result, we obtain a high-dimensional rich statistical descriptor of images, the so-called rich model, which is then used for the classifier training. To demonstrate the power of the proposed methodology, we construct rich models for images in raster formats and JPEGs, the two most commonly used image representations. The 34, 671-dimensional spatial domain rich model consists of co-occurrences of neighboring samples of noise residuals obtained by various filters, and the 22, 510-dimensional JPEG domain rich model consists of submodels capturing different types of dependencies among DCT coefficients of JPEG images. Both rich models, combined with the ensemble classifier, are shown to significantly outperform previous art across a wide range of steganographic schemes hiding data in both domains. This work is presented as a self-contained text, covering all technical details of the ensemble classifier and the rich-model construction, their implementation, and experimental performance evaluation.
منابع مشابه
Parallelization of Rich Models for Steganalysis of Digital Images using a CUDA-based Approach
There are several different methods to make an efficient strategy for steganalysis of digital images. A very powerful method in this area is rich model consisting of a large number of diverse sub-models in both spatial and transform domain that should be utilized. However, the extraction of a various types of features from an image is so time consuming in some steps, especially for training pha...
متن کاملLearning Document Image Features With SqueezeNet Convolutional Neural Network
The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...
متن کاملFusion of Steganalysis Systems Using Bayesian Model Averaging
The increasing use of steganography requires digital forensic examiners to consider the extraction of hidden information from digital images encountered during investigations. The first step in extraction is to identify the embedding method. Several steganalysis systems have been developed for this purpose, but each system only identifies a subset of the available embedding methods and with var...
متن کاملAn extended feature set for blind image steganalysis in contourlet domain
The aim of image steganalysis is to detect the presence of hidden messages in stego images. We propose a blind image steganalysis method in Contourlet domain and then show that the embedding process changes statistics of Contourlet coefficients. The suspicious image is transformed into Contourlet space, and then the statistics of Contourlet subbands coefficients are extracted as features. We us...
متن کاملClassifiers and machine learning techniques for image processing and computer vision
We propose the use of classifiers and machine learning techniques to extract useful information from data sets (e.g., images) to solve important problems in Image Processing and Computer Vision. We are interested in: two and multiclass image categorization, hidden messages detection, discrimination among natural and forged images, authentication, and multi-classification. Keywords-Machine Learn...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012